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Abstract. Both the phenomenology and the theory of minority games (MG) with 
more than two strategies per agent are different from those of the conventional and 
extensively studied case S = 2. MGs with 5 > 2 exhibit nontrivial statistics of 
the frequencies with which the agents select from their available decision making 
strategies, with far-reaching implications. In the few theoretical MG studies with 5 > 2 
published so far, these statistics could not be calculated analytically. This prevented 
solution even in ergodic stationary states; equations for order parameters could only be 
closed approximately, using simulation data. Here we carry out a generating functional 
analysis of fake history batch MGs with arbitary values of S, and give an analytical 
solution of the strategy frequency problem. This leads to closed equations for order 
parameters in the ergodic regime, exact expressions for strategy selection statistics, 
and phase diagrams. Our results find perfect confirmation in numerical simulations. 
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1. Introduction 

Minority games |2] were proposed as simple models with which to increase our 
understanding of the origin of the observed nontrivial dynamics of markets; these 
dynamics are believed to result from an interplay of cooperation, competition and 
adaptation of interacting agents. Minority games were indeed found to show intriguing 
nontrivial relations between observables such as the market volatility, overall bid 
correlations and sensitivity to market perturbations, and the information available for 
agents to act upon. Nevertheless, over the years we have found that they can to a 
large extent be solved analytically using equilibrium and non-equilibrium statistical 
mechanical techniques (see e.g. the recent textbooks and references therein). This 
desirable combination of complexity and solvability is their beauty and appeal. 

There have been many advances in the mathematical study of MGs, but nearly 
all calculations have so far been carried out for S = 2, where S denotes the number 
of trading strategies available to each agent. One typically finds phase transitions 
separating an ergodic from a non-ergodic regime, which for S = 2 can be located exactly. 
For S* > 2, in contrast, no such exact results are available. Although mathematical 
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theory has been developed along the lines of the S = 2 case El El IE] , the authors of 
the latter studies ultimately ran into the nontrivial problem of calculating the statistics 
of agents' strategy selections, a problem which resisted analytical solution. In this paper 
we generalize the theorists' toolbox: we develop a generating functional analysis ,9j for 
MGs with arbitrary finite values of S, and show how to derive exact closed equations for 
observables and predict phase transition lines. We will do so for the simplest so-called 
batch version of the MG jlO| lllj. with fake (i.e. purely random) external information 

Our paper is organized as follows. We start with model definitions and review the 
phenomenology of MGs with S* > 2 as observed in simulations, focusing on the crucial 
differences with the more familiar 5 = 2 case and on the core problem of determining 
the statistics of strategy selection frequencies. The next step is a formal generating 
functional analysis (of which most details are relegated to an appendix), leading as 
usual to an effective single agent process, here describing the evolution of S strategy 
valuations. We then state the strategy selection frequency problem in the language of 
the effective agent process, and solve it. The result is an exact (but nontrivial) set of 
closed equations for the relevant observables in MGs with arbitrary S in the ergodic 
phase, and predictions for the location of the phase transition (where ergodicity breaks 
down). S" = 3 is the simplest nontrivial situation to which our new theory applies, so 
we present extensive applications to S* = 3 and test each prediction against numerical 
simulations. This is followed by some further (more limited) applications to 5 = 4 and 
S = 5 MGs, again verified via simulation experiments, and predictions regarding market 
volatihty and predictability. Although the theory is initially set up for arbitrary types 
and levels of decision noise, we will concentrate in this paper mostly on MGs with either 
additive or absent decision noise. We end our paper with a discussion of our results and 
their implications. 

2. Definitions 

The MG describes agents in a market, labeled hj i = 1 . . . N. Each agent i is 
required to submit a bid 6j(£) G { — 1,1} (e.g. 'sell' or 'buy') to this market, at each 
round t E {0, 1, 2, . . .} of the game, in response to public information which is distributed 
to all agents. Those who subsequently find themselves in the minority group, i.e. those 
i for which bi{t)[J2jbj{t)] < (who sell when most wish to buy, or buy when most 
wish to sell), make profit. In order to be successful in the game, agents must therefore 
anticipate how their competitors are likely to respond to the public information. 

The mathematical implementation of the game is as follows. The public information 
at time t (e.g. the state of the market) is represented by an integer number fi(t) e 
{1, . . . ,p}. Each agent i has S private strategies G {—1, 1}^ at his disposal, labeled 
by a = 1 ... S*, with which to convert the observed information into a binary bid. A 
strategy is a look-up table giving prescribed bids for each of the p possible states of 
the market: upon observing /i(t) = /x at time t, strategy a of agent i would prescribe 
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submitting the bid = i?^". All pNS entries G {—1,1} are drawn randomly and 
independently at the start of the game, with equal probabilities. The strategy used by 
agent i at time t is called his 'active strategy', and denoted by ai(t) G {1, . . . , S}. Given 
the active strategies of all agents, their collective responses are fully deterministic; the 
dynamics of the MG evolves around the evolution of the N active strategies ai{t). 

Agents in the MG select their active strategies on the basis of strategy valuations 
Viait), which indicate how often each strategy would have been profitable if it had been 
played from the start of the game. In the so-called batch version of the MG these 
valuations are continually updated following 

V^ait + 1) = vUt) + 0^a{t) " ^ E A,{t)ir; (l) 

V A 

Here A^^it) = N^^^^ J2j Mj^^^^^ is the re-scaled overall market bid at time t that would 
be observed upon presentation of external information fi, fj (the 'learning rate') is a 
parameter that controls the characteristics time scales of the process, and 6ia{t) denotes 
a (small) perturbation that enables us to define response functions later. We abbreviate 
v*(t) = {vii{t), . . . ,Vis{t)) G IR*^. The active strategies are now determined at each time 
t and for each agent i by a function m : IR'^ {1, . . . , 5} that promotes strategies with 
large valuations, but may allow for a degree of randomness. Typical choices are 

deterministic : m(v) = argmax^[fa] (2) 

additive noise : m(v) = argmax^[t>a + Tza{i)] (3) 

multiplicative noise : m(v) = r(£) argmax^[t>a] + [1 — r{i)]a{i) (4) 

Here Za{t) is a zero- average and unit- variance random variable, drawn independently for 
each (a, t), T > is a parameter to control the randomness in Q, r(t) G {0, 1} is drawn 
randomly and independently for each i from some distribution P(r), and a{t) is drawn 
randomly and independently for each t from {1, . . . , 5} (with equal probabilities). We 
recover the deterministic case (j2]) by taking the limit T ^ in 0, or P(r) — 5(r — 1) 
in (0]). In order to compactify subsequent equations we will write all random variables 
at time t in ()3l4j) . which will be drawn separately and independently for each of the A^ 
agents, as z(t). This allows us to write generally aj(t) = m(v*(t), z*(t)). The (stochastic) 
equations describing the MG with arbitrary 5* can therefore be written as 

~ p N 

vUt + 1) = vut) + eut) - ^ E E B^B^^'^'^"^^ (5) 

Alternatively we can write the process (0) in probabilistic form, i.e. as an evolution 
equation for the probability density P(y^, . . . , v^) of the A^ strategy valuation vectors. 
Upon abbreviating {z} = (z^, . . . , z^) and {v} = (v^, . . . , v^) this can be written as 



^m({v}) = /4v'} W^.({v};{v'})P,({v'}) (6) 

A^ 



{z} 
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Averages over the process (j6l7|) are written as (...)• The market fluctuations in the MG 
are characterized by the average {Ait)) = J2|l{Af^(t)) , which one expects to be zero, 
and the covariance kernel Eu' = J2fi{[Afi{t) — {A(t))] [A^(t') — {A(t'))]). In particular, 
the volatility a is defined by = lim^^oo J2t=i '^tt, and the direct predictability is 

measured by H = p"^ E;.[lim,-oo ELiI^mW - (Mt)))? = lim^^oo r'^ E?t'=i S^'- 
For detailed discussions of the relations between the above and various alternative MG 
versions we refer to 0111 • 

3. Phenomenology of MGs with arbitrary values of S 

Before diving into theory it is perhaps helpful to describe first the phenomenology 
of MGs with 5* > 2, as observed in numerical simulations, with emphasis on those 
aspects in which they are distinct from S = 2 MGs. All simulations of which results 
are shown in the remainder of this paper involved MGs with N = 4097 agents and 
no decision noise, and stationary state measurements were taken either over the time 
interval 500 < t < 3000 (for a < 32) or over 100 < t < 1100 (for a > 32). 

In early (numerical and theoretical) studies on MGs with 5* > 2 the emphasis was 
often on the behaviour of the volatility. Its dependence on a was found to be very similar 
for different values of S, but with the phase transition point adS) (that separates a 
nonergodic regime at small values of a from an ergodic regime for large a) increasing 
with increasing values of S. This latter dependence was conjectured in P to be roughly 
a{S) ~ ac(2) + ^(5* — 2). The fundamental differences between 5 = 2 MGs and S > 2 
MGs become clear as soon as one tries to go beyond measuring observables derived from 
the overall market bid (like the volatility), but turns to quantities such as the fraction 
of 'frozen' agents or the long-time correlations. For 5* > 2 it is no longer obvious how 
such objects must be defined. Some agents are found to play just one strategy, some 
play two, some play three, etc.; for 5 > 2 agents can apparently be 'frozen' to various 
extents, which cannot be captured by a single number (j) (the fraction of frozen agents 
for S = 2, see e.g. [Sill])- Similarly, it is not a priori clear which variables should be 
measured to define correlation functions. 

The dynamics of MGs is about the selection of active strategies aj(t) G {1, . . . , S*} 
by the agents, so let us observe in simulations how agents select strategies for 5* > 2 
in the stationary state. We define the frequencies = \imr->oo J2t<T ^a,ai{t), where 
/* measures the fraction of time during which agent i played strategy a. Each vector 
fj = (/{, . . . , fg) is a point in the {S — l)-dimensional plane Ef=i /a = 1 [0; l]*^- The 
collection of these N points gives a view on the collective stationary state of the system. 
For S' = 3 the vectors fj can be plotted directly as points in [0, 1]^, giving rise to figures 
such as fig. n We can extract relevant information from these graphs. For large a the 
agents tend to involve all three strategies, but not with identical frequencies (otherwise 
one would have seen f, = (|, |, |) for all i). As a is reduced, the vectors f, tend to 
concentrate on the borders of the plane Ef=i /a = 1 [O5 1]"^' which is where one of 
the /* equals zero, implying agents who play only two of their strategies, but again not 
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Figure 1. Each plot shows the frequency vectors ii — {fl, /2, /J) with which the agents 
use their three strategies, drawn for each agent i as a point in [0, 1]^ (giving N points 
per plot), as obtained from numerical simulations of an MG without decision noise with 
N = 4097 and S' = 3. The constraint J2a fa = ^ fo^' * implies that all points are in 
the plane that goes through the three corners {(1, 0, 0), (0, 1, 0), (0, 0, 1)} (each corner 
represents agents 'frozen' into using a single strategy). Left plots: unbiased initial 
conditions (random initial strategy valuations drawn from [—10^'', 10^^]). Right plots: 
biased initial conditions (random initial strategy valuations drawn from [—10, 10]). 

with identical frequencies. Upon reducing a further we enter the nonergodic regime 
(which explains the difference between the two graphs for a = 1/64), with points either 
concentrating in the corners {(1, 0, 0), (0, 1, 0), (0, 0, 1)} (for biased initialization) or on 
more constrained subsets where two strategies are played in very specific combinations 
(for unbiased initialization). If we measure the distribution g{fa) = J2i^{fa — fa) 
also obtain quantitative information on the density of points in various regions of [0, 1]^, 
complementing the information in figure ^ Typical examples are shown in figure El for 
different values of a; note that the symmetry of the problem guarantees that for N oo 
all S distributions g{fa) must be identical (this is confirmed in simulations). Temporal 
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Figure 2. Histograms of the fraction g{fi) of agents that play strategy 1 with 
frequency /i in the stationary state (observed in simulations with N = 4097 and 
S = 3), for a G {1/128, 1/64, 32, 64} (increasing by a factor 2 at each step). 
Left: unbiased initial conditions (random initial strategy valuations drawn from 
[— 10"'*, 10~^]). Right: biased initial conditions (random initial strategy valuations 
drawn from [—10, 10]). 
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Figure 3. Evolution in time of active strategy selections for the first 41 out of the 
total number N — 4097 of agents in an MG with S = 3, following unbiased initial 
conditions. The chosen strategies are indicated by grey levels: white means ai{t) = 1, 
grey means ai{t) = 2, and black means ai{t) = 3. Left graph: a = 1/128 (in the 
nonergodic regime). Right graph: a = 8 (in the ergodic regime). 



information on how such macroscopic states are reahzed can be obtained by showing the 
variables ai{t) as functions of time. For S — 3 and unbiased initial conditions this has 
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Figure 4. Each plot shows the N valuation velocity vectors {v\,v^2j^3) points 
in TR^ , obtained from numerical simulations of an MG without decision noise, with 
N = 4097 and 5 = 3. Left plots: unbiased initial conditions (random initial strategy 
valuations drawn from [—10"'*, 10"'']). Right plots: biased initial conditions (random 
initial strategy valuations drawn from [—10,10]). Note the vastly different scales in 
the three graphs (increasing from top to bottom). 

been done using grey-scale coding in figure El showing the difference between the small 
a regime (left), where agents tend to alternate two of their strategies equally, and the 
large a regime, where agents tend to involve all three strategies, at non-uniform rates. 

Strategy selections are made on the basis of strategy valuations, which in MGs are 
known to grow potentially linearly with time. Only those strategies with the largest 
growth rates will be played. If one measures the growth rates (or 'strategy velocities') 
of the valuations, tJ^ = T'^{v^^{t + r) — f*(t)), in the stationary state (i.e. for large r 
and t) for S = 3, and subsequently plots the velocity vectors {v\,V2,vl) as points 
in IR^, one obtains graphs as in figure |^ For small a all velocities are concentrated 
very close to the origin; as a is increased they become consistently more negative, with 
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Figure 5. The plots show the first three components of the frequency vectors 
fi — (/ii /2: /a J /I) with which the agents use their three strategies, drawn for each agent 
i as a point in [0, 1]'^ (giving TV points per plot), obtained from numerical simulations 
of an MG without decision noise, with = 4097 and S = A. The constraint J2a fa = ^ 
for all i now implies that all points are in a the hyper-plane of which in the present 
graph one sees a 3-dimensional projection. Left plots: unbiased initial conditions 
(random initial strategy valuations drawn from [-10-'', lO-'^]). Right plots: biased 
initial conditions (random initial strategy valuations drawn from [—10, 10]). 

concentration of points with two identical components at intermediate a (consistent 
with agents playing two strategies only) and concentration of points along the diagonal 
where all three components are identical for large a (consistent with agents playing all 
three strategies). 

For 3 = 4 one can obviously no longer plot the frequency vectors f; = (/{, /g, f^, fl) 
as points in [0, 1]^, but one has to resort to projection from 4D to 3D: we plot in figure 
El only the first three components {fl.fl^fD- The corresponding strategy frequency 
distributions are shown in figure El Once more we observe the ergodicity/nonergodicity 
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Figure 6. Histograms of the fraction Q{fi) of agents that play strategy 1 with 
frequency / in the stationary state (observed in simulations with N = 4097 and 
S — 4), for a G {1/128, 1/64, 32, 64} (increasing by a factor 2 at each step). 
Left: unbiased initial conditions (random initial strategy valuations drawn from 
[—10"'*, 10"'*]). Right: biased initial conditions (random initial strategy valuations 
drawn from [—10, 10]). 

phase transition (here occurring for a larger value of a than was the case at 5 = 3), and 
the tendency at small a for agents to play only a specific subset of their four strategies. 
For unbiased initial conditions the agents are seen to play in the nonergodic regime 
always either one or two of their strategies; this curious tendency, for which there is no 
immediate obvious explanation, is found also for larger values of S (we have confirmed 
this for values up to 5 = 6). Plotting the first three components of the valuation 
velocity vectors (l^,U2,I^, tJ^) leads to the graphs shown in figure [7| As with S* = 3 
we see the agents playing all 5* strategies for large a, but selecting specific subsets for 
smaller a, generally with nontrivial frequencies. Here there are more options than at 
5* = 3 for doing so: agents can and will go for either one, two, three or four strategies. 

If we finally measure and plot the distribution gijJa) = J2i ^{va — vD of the N 
strategy velocities for strategy a, we see clearly a consistent tendency of the strategy 
velocities to shift towards negative values as a increases (which indicates an increasing 
inability of the agents to be successful in the game). Examples are shown in figure |H1 at 
different values of a, for S = 3 and 5 = 4, both following unbiased initial conditions. 
The symmetry of the problem guarantees that for N ^ oo the distribution g{va) will 
be the same for all a (although it will obviously depend on S). 

It is clear from these figures that the complexities of MGs with S > 2 are in the 
nontrivial dependence on control parameters and initial conditions of the frequencies 
with which the agents use their available strategies. Mathematically one finds this 
reflected in a nontrivial closure problem, as we will see below. 
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a = 1/64 






Figure 7. Each plot shows the first three components {vi,V2,v^3) of the N valuation 
velocity vectors as points in M^, obtained from numerical simulations of an MG without 
decision noise, with TV = 4097 and S = A. Left plots: unbiased initial conditions 
(random initial strategy valuations drawn from [-10"'*, 10-"]). Right plots: biased 
initial conditions (random initial strategy valuations drawn from [—10, 10]). Note the 
different scales in the three graphs (increasing from top to bottom). 



4. Generating functional analysis for general 5" 

In solving the dynamics of the process (j6l7p for general S we can follow the strategy of 
the S = 2 case (see e.g. ^01 E] or jlj), although we no longer benefit from simplifications 
such as a reduction to equations and observables with valuation differences only. We will 
therefore suppress details and give only relevant intermediate stages of the calculation. 
For general S the canonical disorder-averaged:}: moment generating functional will be 

X As always, the pNS strategy entries are regarded as frozen disorder, and disorder averages are 
written as (...). 
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Figure 8. Histograms of tlic fraction q{vi) of agents tliat liave strategy velocity vi 
for strategy 1 in the stationary state (observed in simulations with N = 4097), for 
a £ {1/128, 1/64, 32, 64} (increasing by a factor 2 at each step). Left: S = 3. 
Right: S ~ 4:. All measurements were taken following unbiased initial conditions 
(random initial strategy valuations drawn from [—10^^, 10^*]). 



where aj(t) = m(v*(t), z*(t)) is the active strategy of agent i at time t. It generates 
dynamical observables such as the correlation- and response functions 

1 

N 
1 



(9) 



. 1 ^ d^zg^] 



(10) 



Ctt' gives the likelihood that the active strategies at times t and t' are identical; Gu' 
measures the increase in probability of a strategy being active at time t following a 
perturbation of its valuation at time t' (both averaged over all agents and over the 
disorder). Causality guarantees that Gu' = for t < t'. We note that random strategy 
selection by the agents would give Cu' = S^^ + 6tt'{l — S^^) and Gu' = 0. A fully frozen 
state would be characterized by Gu' = 1 and Gu' = 0. 

The functional (jHl) is an average over all possible paths of the combined A^-agent 
state vector {v} in time. The probability density for each path is a product of the 
kernels Wt{. . ■) in ((Tj), so upon writing the (5-functions in ((Tj) in integral form one finds 



n 



X ( e 



(2vr) 



Po(v\...,v^) e'^^aA 



(t)[vl{t+l)-vlit)-9,a{t)] 



m(v»{t),z«{t)) 



i(r}/N)Y.. . Ri'^Ri'' vi(t)5 , , ,,,, ,,,,, 



'{z} 



The dependence on strategy entries in the exponent is linearized by introducing auxiliary 
variables = vN'^/^ J2iavi{t)K^^ (via suitable 5-functions) , after which the disorder 
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average is carried out. In leading order in N, the result depends on the {v} only via 



z}) — — Sa^rn(vHt).zUt))^a,m(vHt'),zHt')) 
ia 

ia 



(12) 
(13) 
(14) 



Upon isolating (jl2ll3|14p via integrals over suitable 5-functions we can factorize and 
integrate out the site-dependent variables, given site-factorized initial conditions. We 
then find an expression for Z[i/)] which can for iV — > oo be evaluated by steepest descent: 

^Af(<I'+<I>+n)+C'(logAr) 



Yl dCtt'dCtt'dKu'dKtt'dLtt'dLtt' 



Ltt' 



[^tt'Ctf + Ku'Ktt' + Ltt'Lw 



(15) 
(16) 



a log 
1 



-j-j- dx^t4Xt_^ixtxt 



27r 



N 



n 

at 



dVa{t)dVa{t) 

2-K 



^o(v(0)) 



For N ^ oo the order parameters {C, C*, K, L, L} are determined by the saddle-point 
equations of the exponent \& -(- $ -f Working out these equations is straightforward 
but lengthy, see [Appendix A The resulting theory can be written solely in terms of the 
kernels f|9ll0|) . and, upon choosing 9ia{t) = 9a(t), formulated in terms of the following 
stochastic process for the strategy valuations v = (t>i, . . . , vs) of a single 'effective agent', 
with zero-average coloured Gaussian noise forces {rja} and a retarded self- interaction: 

Va{t + 1) = Va{t) + Bait) - Rtt'Sa,m(v(t')Mt')) + V« Vait) (19) 

t'<t 

Rw = fj{l + fjG)-} {Vait)vbit')) = 6abiRCR%, (20) 
In terms of averages over the process (|19l2(jp . the kernels C and G must be solved from 

- 2^\^Ja,m(v(t),z(i))0a,m(v(i'),z{t'))/ " f)n ^^^1 

a a ) 

Finally, in a similar manner one derives exact expressions, in the limit ^ oo, 
for the disorder-averaged bid covariance matrix S^f/ and also for the volatility and 
predictability a and H (which both should be self-averaging for A^ oo). Here the 
appropriate generating functional is 



(22) 
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From (|22j). which is calculated from the result of evaluating the previous functional Z[tjj] 
upon making simple substitutions (see Appendix B for details), one obtains 



dZ[^] 



lim (AJt)) = - i lim lim ^ , V = (23) 



lim - Y iAJt)AJt')) = - lim -V lim ^, ^ 51^^ , 

= r\RCR%' (24) 
It follows that in the limit — > oo we must have = ■^{RCR^)u' , and hence 

a' = }^ - Y,{RCR% H = 1 j2 iRCR%, (25) 

r^oo T T^oo J ^^^^^ 

5. Stationary states: the strategy frequency problem 

To find time-translation invariant stationary solutions of our MG without anomalous 
response and with weak long-term memory memory we put Cu' = C{t — t') and 
Gtt' = G(t — t'), and we define the usual static order parameters, viz. the persistent 
correlations c = limt_,ooC(^) and the static susceptibility x = J2t>o^(J^)- Consequently 
also Rtt' = R{t — t') and 

We extract from ()19l20p an equation for the valuation growth rates Va = lim^^oo Va{t)/t. 
This involves the 'frozen' Gaussian fields r/^ = \imt-,oot^^ J2t'<tVa{t'), the persistent 
perturbations 9a = ^i^t-^oot^^J^t'Kt^^ait') and the strategy selection frequencies fa- 

Va = 0a + Var]^ - ax^fa fa = lim ^ Xl(^«:™(vW,z))z (27) 

s<t 

Since (rj^) = and (r7„%) = 5abcxl, we write rJa = XuVcXa, where x = {xi,...,xs) 
is a vector of S uncorrelated zero-average and unit-variance frozen Gaussian variables. 
So are 2S relations for the 25" unknown variables (yi, . . . ,vs) and (/i,...,/s), 
parametrized by x. Elimination of the valuation growth rates would give a formula for 
/a(x), with a = 1 ... 5*. To emphasize this structure of our problem we write ()27j) as 

/a(x) = Xa\[^ + — ^^^^ /a(x) = lim J V(5a,m(v(s,x).z))z (28) 

V a axa ^ ^ 

One easily derives from ()21|) closed equations for the static order parameters, via 
c = limt^^t~'^Y.ss'<tC{s - s') and x = limt-.oo t"^ Es<t ^('^a,m(v(s),z(s)))/5^a, in terms 
of the solution {/^(x)} of The equation for x can be simplified further upon noting 
that 8/ 89 a = a^^/'^8/8fj^ = (Xj^yac)"^9/9xa. The result is 

C= fo^Yfai^) X = ^^ (D^Y^afai^) (29) 
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(with the usual short-hand Dx = (27r) 2^ dx). The perturbation fields 9a are now 
no longer needed and can be set to zero, which simplifies (j^Hj) to 

/a(x) = Xa\[^ - ^^^^ /„(x) = lim J V(5a,m(v(s,x),z))z (30) 

V a t 

Solving the stationary state of the MG, including finding the phase transition line 
marked by a divergence of x, thus boils down to solving {/^(x)} from (p?n|) . This is 
the strategy frequency problem. The difficulty is in the second part of ()30|): even if 
two strategies (a, 6) have I'a(x) = ^'^(x) it does not follow that /a(x) = /^(x) (this 
is also clear in simulations). All valuation growth rates Vc(x), including those with 
c ^ {a, b}, will infiuence /a(x) and /^(x). The frequencies depend in a highly nontrivial 
way on both the realization of the Gaussian vector x (which represents the diversity 
in the original A^-agent population) and the control parameter a. Even the transients 
of the valuations, i.e. the full Va{s,x.) rather than just their growth rates I'a(x), could 
in principle impact on the long-term frequencies {/^(x)}. All this appears to make the 
problem practically insoluble. For S = 2 the situation could be saved upon translation 
of our equations into the language of valuation differences; there was only one relevant 
quantity, q = Vi — V2, and what mattered was only whether or not q = 0. For S > 2 
this is no longer true. 



6. Solution of the strategy frequency problem 

We turn to the general solution of the strategy frequency problem, for arbitrary S and 
additive§ decision noise: m(v, z) = argmax^[fa + Tza]- This includes the deterministic 
case for T = 0. We define I7*(x) = max;, iJfc(x), and the set A(x) of all strategy indices 
for which Vai'x.) = v*{x.): 

A(x) = {a| l;„(x) =maxUfe(x)} C |A(x)| > (31) 

The solution now proceeds in three stages: 

• Since Va{s,x.) = s[Ua(x) + £:a(s,x)] with lim<j^oo £^a(s, x) = we may write for the 
second equation in (PHj) 

T(z-z' 



/a(x) = hm (|]^ (^e[va{:X.) -t^fe(x) +6a{s,X.) - ^^(s, x) + 



This quantity can be nonzero only for a G A(x). Hence, once we know A(x) and 
v*{x.) the problem is solved, since in combination with ()28|1 we may write 

a ^ A(x) : /a(x) = 0, tJa(x) = XaXnV^ (32) 



aGA(x): fa{^)=xj ^, U,(x)=U*(x) (33) 

V a 

§ The solution in the case of muhiphcative decision noise is not identical but very similar to the one 
discussed here; see [Appendix D| for details. 
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Next we calculate tJ*(x). Probability normalization guarantees that I]a/a(x) = 1 
for any x, but since /a(x) 7^ only for a G A(x) we have in fact J2a&A{x) /a(x) = 1. 
Summing over the indices in therefore leads to 



Upon abbreviating |A(x)|^^ J2beA{x) Uh = (f^)A(x) our equations ()32I33|) then become 
A(x) : /a(x) = 0, iJa(x) = XaXnV^^ (35) 

aGA(x): /,(x) = ^^ + y|(a;„-(x)A(x)), iJ,(x) = ^(x) (36) 

What remains is to determine the set A(x). The definition (jHTj) of A(x) demands 
that Va{^) < t'*(x) for all a ^ A(x), i.e. 

[a 1 

a ^ A(x) : Xa < (x)a(x) - y ~JX(^\ (37) 
However, we have similar inequalities for a G A(x), as (jHBj) must obey /a(x) G [0, 1]: 
a G A(x) : > (x)a(x) - /^^^^ (38) 

^« < (^)a(x) - /f + /f (39) 

These last three groups of inequalities turn out to determine the set A(x) uniquely. 
Firstly, they tell us that A(x) must contain the indices of the i largest components 
of the vector x, where i = |A(x)|. With probability one we may assume all 
components of x to be different, so for each x there is a unique permutation 
TTx : {1, . . . , 5} ^ {1, . . . , 5} for which these components will be ordered according 
to Xn{i) > Xn{2) > ■ ■ ■ > a;,r(5)- We may now translate our key inequalities into the 
following three conditions, where £ = |A(x)| G {1, . . . , S*} is the only quantity left 
to be solved, and with the ordering permutation vr = tTxI 

1 1 /a 



x^H) > E ^Arn) -JZiy- if ^ > 1 (40) 

1 ^ 1 /cF 

Xn{e+i) < 7 E ^^im) ~ -n\ - H £ < S (41) 



m=l ' ^ 

1 ^ 1 fa [a 

The solution is the following: i is the smallest number in {1, . . . , S*} for which ()41|] 
holds (if any), whereas if (PT|) never holds then i = S. By construction we thereby 
satisfy both pOjl and (jHj). What remains is to show that also (jl^ will be satisfied, 
and that the solution is unique. Clearly ()42|) holds when i = 1. To prove for 
e > 1 we first define X)^ — k ^ ^2m<k •^7r(m) 

for k < i. We know from pn|l that 

Y i 

k J / y •^7r(m) ~l~ , "''7r(A;) 
^ m=l ^ 
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1 1 1 /a 1 fa 



Hk-l)^^^""' k{k-l)\ c k{k-l)\ c 



(m) — TJ-j 7T\I — — Xk-l 



Hence X, > Xi - EL=2 ^^^^ = - ^ EL=2 so that 

I' fa'^ 1 I' fai-l 



This is the inequahty (j42p that we set out to prove. The corollary is that we have 
indeed defined a self-consistent solution of our equations. The above solution must 
be unique: any alternative choice of i (rather than the smallest) that satisfies 1)411] 
would always make the previous (smallest) choice induce a violation of (PUj). 

We may now summarize the solution of the strategy frequency problem for additive 
decision noise as follows: 

a i A(x) : A(x) = (43) 

a e A(x) : /,(x) = ^ + \/f (^'^- (^)a(x)) (44) 

A(x) ={7r.(l),...,7rx(£(x))} (45) 

permutation such that ^^^(i) > x^(2) > . . . > x,r(5) (46) 

(x) : defined by the conditions (47) 

1 ^ 1 fa 

< - E ^^'tM - -7 J- if^<S' (48) 

^ m=l t V C 

1 1 /a 



> T E ^'tM ~ T.\I a\lk <i (49) 

The meaning of this stationary state solution is as follows. The randomness induced by 
the Gaussian variables x represents the variability in the original X-agent population. 
The set A(x) contains the strategies that will be played by the effective agent, albeit 
with different frequencies. Strategies a ^ A(x) are never played. Apparently, for a 
strategy a to be in the 'active' set A(x), it must have an Xa that is sufficiently large, 
and sufficiently close to the components of the other strategies in the active set. 

Since expressions (I43|44j) depend only on A(x), i.e. on tTx and i (see above), we 
may abbreviate these formulae as /a(x|7rx,^). Averages of the form /L'x ^({xa, /a(x)) 
can now be written as a sum over all permutations vr of {1, . . . , S"}, with a function 
C(7r|x) = S-j^^Tj^ that selects the right component ordering permutation for each x: 

/ Dx $(K, /,(x)}) = E E /^^ C'(7r|x) <|.(K, /.(xItt, £)}) 

1 A I fa 

X 



X 



i^Ses + (1 - Ses) ^["^ E ^-^M - J\l~. ~ | 
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C(7r|x) = Yl 9[x^(^a) 



) ^n(a+l)\ 
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(51) 



a=l 



7. The static theory for arbitrary S 

In those cases where one seeks to calculate the average of a function $ that is invariant 
under all permutations of the index set {1, . . . , S"}, as in ()29|). one may use this invariance 
to simplify the calculation. Here the average will equal the contribution from one 
particular ordering of the components of x (and its associated permutation tTx, for 
which we may take the identity permutation) times the number S\ of permutations: 

/ DX $(K, /,(X)}) =S\j2 f D^I[0[Xa - Xa+lM{Xa, /a(x|£)}) 

e=i •' a=l 



X 



a<e 

X Jfe+(1- 



a 



m=l 



a \ c 



Xr, 



m=l 



1 fa 



(52) 



In particular, upon application to (j29p and using (j43l44j) we have now arrived at fully 
explicit and closed equations for our static order parameters: 



•^'EE /^^ Y[0[Xa-Xa+l] U^^i^a 



1 A 1 fa 

+1 Xm~\ \ 



■ a=l " 
X i + (1 - bts 



a 



m=l 



a \ c 



X 



1 



VX 
l+fjx 



: + \l-{Xa- 7 E 

m=l 

Q] s e „ s-1 

EE n^[^^ 



(53) 



£=1 a=l 



a=l 



Xa+l] n E ^™ 



m=l 



1 fa 
a \ c 



(54) 



m=l 



For any given value of a one first solves ()53p for c, after which x is calculated via (jMj) . 
The X = oo phase transition occurs when the right-hand side of (jH^ equals one, and 
defines the critical point ac{S). Numerical solution of ()53I54|) for S G {2,3,4,5} gives 
the following values (accurate up to the last digit given): 



s 


2 


3 


4 


5 


ac{S) 


0.337 


0.824 


1.324 


1.822 
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These values are remarkably close to those one would predict on the basis of the heuristic 
relation a{S) ~ ac(2) + ^(S* — 2), as proposed in [H], but not identical. 

Below the critical point, in the nonergodic regime a < adS) the above theory 
no longer applies, as its assumption of finite integrated response x is violated. For 
S" = 2 it was shown [TUl E] that upon replacing by the equation = 0, where 
X is calculated from ()54p. the agreement between ergodic theory and simulations with 
biased initializations in the nonergodic regime could be improved. For S > 2 such 
heuristic improvements are again possible but more awkward. Furthermore, they are 
entirely ad hoc and artificial, and therefore mostly of cosmetic merit, so we have decided 
to restrict ourselves here to the theory (j53l54|) . 

For arbitrary S we must also generalize the concept of 'frozen agents' and the 
associated order parameter 0: we define (pk as the fraction of agents that in the 
stationary state play k of their S strategies. Since k is simply the size of the active 
set A(x) in the language of ()52j) the order parameter 0^ is the average of the function 
/a(x|£)}) = 6ke, so our solution immediately tells us that 



5-1 



Y[ e[Xa - Xa+l] Yl 0[xa - - Xm + - 



a=l 



a=l 



a 



m=l 



X 



1 ^ 



m=l 



1 fa 
i y c 



Jd^YI e[Xa - Xa+l] n ^^-^ ~ " 

a=l a=l ^ m=l 



1 

m ~l~ 1 

a 



(55) 
(56) 



One confirms easily that these expressions obey Y.'k=i 'Pk = 1, as they should. Our 
formulae ()55|56|) can be simplified, if needed, by generating and exploiting permutation- 
invariant terms. For instance, 0i and 02 can be rewritten as 

s 



\x. 



m—l 



Xri 



s\ 



! / Dx e[xi -X2- J^] n ^l- 
S\ j Dx. 9[xi — J— — maxxa] Y\_ d[xm-i 

^ m=3 

J Dx. 9[xi — — maxxa] 

11 / — ^ 

1 1 X la\ 

S\ J Dx 9[x2- xi+ ^]0[xi+ X2- 2x3- 

s\ J Dx e[xi- X2]e[x2-xi+ ^]e[. 

fa ^ 

X 9\xi+ X2— 2maxXa— \ —] TT 0\xm-i 

a>2 \/ -L-L 



Xtj 



is -1)1 



S Dx 



(57) 



5 

m=2 



Xr. 



-1 x^, 



maxxa 

a>2 



Xr, 



m=4 
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y y"-Dx 6[xi- X2]0[x2- xi+ ^J^]0[x2- maxxa] 



(S-2) 

X 9\xi+ X2— 2maxXa— \ —] 

a>2 \l c 

S{S-1) jDxDye[^+y-x]e[x-y] 
J Dz e[y - z]e[x + y-2z-^ 



5-2 



2x — u 1 fa 



2 2 V 2 V 2c 



5-2 



(5^ 



We can now also calculate the disorder-averaged strategy frequency distribution 
g{f) = liniAr^oo J2i {^[f — fail), which gives the fraction of agents in the stationary 
state that use strategy a with frequency /. The problem is strategy permutation 
invariant, so g{f) cannot depend on a. We may therefore write it in the permutation- 
invariant form g{f) = limAr_,oo(5'A^)^^ J2ia {^[f " fai]), and calculate it by applying our 
formula §2) to the function <l>({xa, /a(x|£)}) = ^"^ Ef=i 5[/ - /a(x|£)]. This gives, 
upon using our above formulae for the 0^: 

S 5 5—1 

g{f) = (5- 1)! E E /^^ ^[/ - n - ^a+i] 



e=i b=i ■ 

n ^ 



a=l 



1 A 1 /a 



a<i 



m=l 



a \ c 



Ides + (1 - Ses) E 



WEV-|)</>. + 5(/- 1)^01 

5 ^ 

+ (5-i)!EE 

£=2 6=1 

5-1 



S 
1 



Xr. 



m=l 



X 



1 " 1 Hy 

n 6'[Xa - Xa+l] n Olxa+l E ^"^ + ~\/ ~ 



a=l 



a=l 



m=l 



a \ c 



X 



|fe + (i-fe)4^E^™-^/f- 



(59) 



One can recover the full generating functional theory of the S = 2 batch MG as in e.g. 
[Tin ITT] from the above more general equations, as a test (taking into account carefully 
the different definitions of correlation and response functions that were made in earlier 
studies). We will not give details of this in principle straightforward exercise here. 

Finally, in the limit a ^ oo our theory predicts that the MG will behave for any 
S as if the agents were to select strategies completely randomly, as one expects. Here, 
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upon using S\ J2a=i /-^^ nf=i (^[^a—Xa+i] = 1 one finds that our equations simplify to 



lim c = S ^ 

x—>oo 



lim 

«— »oo 



lim Y = 

lim,(/) = 5(/-i 



(60) 
(61) 



8. Application to MGs with S = 3 



The case S* = 3 is the simplest situation where all the complexities of having more than 
two strategies per agent can be studied, so we will deal with this in detail. Here, with 
persistence, one can do most of the nested integrations in ()53I54I55I56I59|) analytically. 
We give some of the basic identities that this involves in [Appendix C The result is 
most easily expressed in parametric form in terms of the auxiliary variable u = \J a/c. 
The fundamental order parameter equation ()53j) for the persistent correlations c then 
becomes 



u 



iC 1 ?/ r 

1 + (1-— )/(«)- -erf(-) 3 + erf( ^ 

3 „,M.r u . u _j_„2 
err( — ^) ^e 12 



+ 



2m2 



H ^erf(- 

2m2 ^2 



^2^3^ 



/37r 



+ 



erf(- 



u 



2m '2V3 



)e-^^ 



(62) 



with I{u) = 2 Jq^^Dx erf(a;/v^) (this latter integral we could unfortunately not do 
analytically, except for the limit /(oo) = 1/3). Solving equation (jHS)) for c gives c as 
a function of a; the result is shown and tested against simulation data in figure IHl (left 
panel). Similarly, equation (jH^ for the susceptibility x takes the explicit form 

3 



VX 



2a 



erf(-) 1 + erf( 



u 



- 21{u) 



(63) 



1 + rix 

It follows that the x = 00 phase transition occurs at a value etc that must be solved 
(numerically) from the following two coupled equations 

3 



a = - <!erf(-) 1 + erf( — ^ 
2I ^2v^^ 



2/(m) 



(64) 



a 

- = 1 + 



a 



u 



ierf(^)[3+erf(-^^, 



+ 



2m2 



erf(^) 



u 



e 2 



u 



H rerf(— ) erf(- 

2m2 ^2^L ^2^3' 



u 



377 



+ 



erf(- 



u 



'2v^ 



\ -i«2 

e 4 



(65) 



Upon solving these equations one finds that Uc ~ 0.824. After doing the integrals in our 
expressions (|55l56p for the fractions (pi of agents that play I of their three strategies one 
obtains the following explicit formulae (expressed once more in terms of m = \J oi/c, i.e. in 
terms of the solution of our previous equation for c), which indeed obey 01 + 02 + 03 = 1: 



3 n/^ \ ^ T / \ 

1 - - erf(-) + -I{n) 
5erf(|)[l-erf(^) 



(66) 
(67) 

(68) 
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Figure 9. Left: the predicted persistent correlations c = Mnir^oo C{t) (solid curve) 
for = 3, as a function of a = p/N, calculated for ergodic stationary states. It is shown 
together with corresponding data measured in numerical simulations of MGs without 
decision noise, for both biased initial conditions (random initial strategy valuations 
drawn from [—10,10], •) and for unbiased initial conditions (random initial strategy 
valuations drawn from [-10-*, 10"^], o). Simulation system size: N = 4097. Dashed: 
the phase transition point ac{3) « 0.824. Right: the fractions (pi of agents that play 
precisely i € {1,2,3} of their 5 = 3 strategies, as a function of a = p/N, calculated 
for ergodic stationary states. Note that (pi + (j>2 + 4>3 = ^- 
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Figure 10. The fractions (pi for 5 = 3 are shown together with corresponding 
data measured in numerical simulations of MGs without decision noise, for both 
biased initial conditions (random initial strategy valuations drawn from [—10,10], •) 
and for unbiased initial conditions (random initial strategy valuations drawn from 
[— 10~^, 10~^], o). Simulation system size: iV = 4097. Dashed: the phase transition 
point ac(3) w 0.824. 

These three expressions are shown as functions of a in figure IHl and tested against 
numerical simulations in figure HH For large a the agents tend to use all three strategies, 
upon reducing a one finds increasing numbers switching to the use of only one or only 
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two of their strategies. Again, as in the previous figures involving c, we observe a perfect 
agreement between theory and simulations in the regime where our theory applies, i.e. 
for a > ac- Furthermore, the value found for etc is consistent with the simulation data 
in that non-ergodicity (a dependence on initial conditions, i.e. on whether one chooses 
biased or unbiased strategy valuations at time zero) is indeed seen to set in at the 
predicted point. 

Finally, perhaps the most sensitive test of our S = 3 theory is to work out and 
validate our formula for the strategy frequency distribution. Upon doing the 
relevant integrals we find confirmed that g{f) = for / ^ [0, 1] (as it should), whereas 
for / G [0, 1] we arrive at 



Qif) = ^ e-"^(^-i)^[l - erf(^)J + ^ e-t-^/-irerf(|(l - /)) 



+ S{f)il<Pi + l<p2) + S{f-l)^<P, (69) 

One can understand this formula qualitatively. A strategy a is not played at all 
when either i = 1 and it is among the two non-selected strategies (this happens with 
probability |0i), or when i = 2 and it is the one non-selected strategy (this happens 
with probability |02)- A strategy a is played permanently if £ = 1 and it is the selected 
one (this happens with probability |0i). Hence the second line of fl69|) . The first line 
of (j69|) reflects a being played now and then (among other strategies), with non-trivial 
frequencies; here one observes the expected local maxima at / = | (corresponding to the 
'average' behaviour for i = 2) and at / = | (corresponding to the 'average' behaviour 
for i = 3). Also the prediction ()(j9|l agrees perfectly with numerical simulations, as 
shown in figure fTTl in the regime a > ac for which is is supposed to be correct. 

In summary, we may conclude that for S" = 3 all our theoretical predictions 
regarding stationary state order parameters, the location of the phase transition, and 
even quantities such as the strategy frequency distribution, make physical sense and 
find perfect confirmation in numerical simulations. 



9. Application to MGs with 5* = 4, 5 

We now turn to S* > 3. Although it is in principle still possible to proceed with the 
various integrals in ()53p54|55p56p59|l analytically, for S* > 3 this would become a very 
tedious and time-consuming exercise. We have here resorted to numerical evaluation; 
one does not expect qualitatively different physics to emerge compared to 5 = 3, and 
life is short. For reasons of brevity we have also restricted ourselves to the validation of 
the static observables c and (pi only. It should be mentioned that upon increasing the 
value of S, the accurate numerical evaluation (based on the Gauss- Legendre method) of 
the various nested integrals in our theoretical expressions becomes nontrivial in terms of 
CPU quite quickly; especially for small values of a. Finding the expressions for adS) as 
presented in section [3 for > 3 already involved a careful assessment of the scaling of 
these values with the parameters that control the numerical integration accuracy. One 
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Figure 11. Examples of the predicted strategy frequency distribution g{f) (thick 
curves), for 5 = 3 and different values of a = p/N, as calculated for ergodic stationary 
states. They are shown together with strategy frequency data measured in numerical 
simulations of MGs without decision noise (shown as histograms, averaged over three 
samples), for both biased initial conditions (random initial strategy valuations drawn 
from [—10, 10], left graphs) and for unbiased initial conditions (random initial strategy 
valuations drawn from [—10"'', lO"**], right graphs). Simulation system size: = 4097. 
Note that ac(3) ~ 0.824, so the top two graphs refer to the nonergodic regime, where 
the present theory is not supposed to apply. 



also finds that for increasing values of S the finite size effects in the non-ergodic regime 
a < ac{S) become more prominent. 

For S — A the results of evaluating numerically the theoretical predictions 
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Figure 12. Left: the predicted persistent correlations c = lim7-_+oo C{t) (solid curve) 
for 5* = 4, as a function of a = p/N, calculated for ergodic stationary states. It is shown 
together with corresponding data measured in numerical simulations of MGs without 
decision noise, for both biased initial conditions (random initial strategy valuations 
drawn from [—10,10], •) and for unbiased initial conditions (random initial strategy 
valuations drawn from [-10"'', lO"''], o). Simulation system size: N = 4097. Dashed: 
the phase transition point ac(4) ~ 1.324. Right: the fractions (pi of agents that play 
precisely £ S {1, 2, 3, 4} of their S = i strategies, as a function of a = p/N, calculated 
for ergodic stationary states. Note that (pi + (j)2 + (f>3 + (f>A ^ 1- 




54f55p56|l are shown and tested against simulation data (obtained for MGs without 
decision noise) in figures [T21 and [T31 For S = 5 they are shown and tested in figures 
El and El It is very satisfactory that once more we observe for S G {4,5} in all cases 
an excellent agreement between theory and simulation data, both with regards to the 
observables measured (viz. c and the fractions (pi), and in terms of the locations of the 
transition points ac(4) and ac(5). 

10. Stationary state fluctuations: volatility and predictability 

Once the static order parameters c and x are known, our formulae for the volatility 
a and the predictability measure H can be reduced to 



c 



a' = r'j:mc{t-t')R{t') ^=(1^^ 

As for S = 2, H can always be expressed fully in terms of persistent order parameters and 
vanishes at the phase transition point. The result is shown in figure ITBl for S G {3, 4, 5}. 
If in the volatility formula we separate the correlation function into a static and a 
non-persistent part, C(t) = c + C{t) with lim^^-too C'(t) = 0, we obtain 

= tttW + E (I + vG)-'(t)C{t - 0(1 + f,G)-\t') 

(-L + VX) u'>0 
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Figure 13. The fractions for 3 = 4 are sliown together with corresponding 
data measured in numerical simulations of MGs without decision noise, for both 
biased initial conditions (random initial strategy valuations drawn from [—10, 10], •) 
and for unbiased initial conditions (random initial strategy valuations drawn from 



[-10-^10- 
point q;c(4) 



], o). Simulation system size: N — 4097. Dashed: the phase transition 
« 1.324. 



+ + vG)-\t)C{t - t'){l + vG)-\t') (71) 

Expression ()7H) is still exact. The first term is recognized to be H. The other terms 
contain the short-time fluctuations, involving non-persistent dynamic order parameters. 
If one seeks a formula for a in terms static order parameters only, one must pay the price 
of approximation. To do this we generalize the two procedures described for the S = 2 
batch MG in e.g. fOl E] and Both rely on the ansatz, motivated by observations 
in simulations, that the response function decays to zero very slowly, subject to the 
constraint J2t G{t) = x- Upon putting e.g. G{t > 0) = — l)e~^* with z one 
finds for such kernels that lim2_+o + vG)it)Y = 0. Consequently we may write 



a' 



c 



E(I + vG)-\t)Cit - t'){l + fiG)-\t') (72) 
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Figure 14. Left: the predicted persistent correlations c = Muir-too C(t) (solid curve) 
for 5* = 5, as a function of a = p/N, calculated for ergodic stationary states. It is shown 
together with corresponding data measured in numerical simulations of MGs without 
decision noise, for both biased initial conditions (random initial strategy valuations 
drawn from [—10,10], •) and for unbiased initial conditions (random initial strategy 
valuations drawn from [-10-*, lO"''], o). Simulation system size: N = 4097. Dashed: 
the phase transition point ac{5) « 1.822. Right: the fractions (pi of agents that play 
precisely £ G {1, 2, 3, 4, 5} of their 5 = 5 strategies, as a function of a = p/N, calculated 
for ergodic stationary states. 



The most brutal approximation of C is to assume the correlations to decay to zero very 
fast, and simply put C{t) — > C{0)6to = (1 — c)Stt'- This leads to the formula 

Alternatively, we may write the average in the definition ()2H1 of the correlation function 
as a sum of contributions representing the possible sizes i of the set of active strategies: 

s 

C{t — t') = 5tt' + (1 — 5tt') X! 0^ X!(^a,m(v(t),z(t)'5a,m(v{t'),z{t'))^ active (74) 

i=l a. 

This is still exact, but now we approximate for t ^t' 

active ~ ^('^a,m(v(i),z(t))^ active ('^a,m(v(<'),z(t'))^ active 



This gives C{t — t') ^ 5tt' — c + (1 — 5u') Z]f=i 0^/^ and hence the approximation 



+ l-^0,/£ (75) 
i=i 

Formula (ffH|l is identical to what was already proposed for 5 = 2 in formula (ffH|) 
generalizes to arbitary values of S the 5 = 2 volatility approximation first published 
in ITT] . Our two approximation formulae have more or less opposite deficits, since 
(fTSj) fails to incorporate any transient contributions to the correlations (here C(t) =0 
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Figure 15. The fractions for 5 = 5 are sliown together with corresponding 
data measured in numerical simulations of MGs without decision noise, for both 
biased initial conditions (random initial strategy valuations drawn from [—10,10], •) 
and for unbiased initial conditions (random initial strategy valuations drawn from 
[— 10"'', 10~^], o). Simulation system size: N — 4097. Dashed: the phase transition 
point ac{5) w 1.822. 



for t 7^ 0), whereas (fTSj) incorporates too many (here hmj_,oo C'(t) = Z]f=i — c 7^ 0). 
For batch MGs one finds that is more accurate than (|73| . see e.g. figure IT71 



11. Dynamics for very small and very large a 

Solving our order parameters C and G from 1)211) for finite times is generally non-trivial 
due to the non-Markovian nature of the effective process ()19p. Only for extreme values 
of a, viz. for a — > and a —>■ 00, and for short times it can to some extent be done. 
The causality constraint Gu' = for t > t' allows us to calculate with little difficulty 
the first few time steps and gain a better understanding of the nature of the dynamics, 
especially with regard to the role of the initial conditions and the dependence on the 
control parameter a. Causality implies that {G^)tt' = for t — t' < n, so that for 
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Figure 16. The measure H = lim 
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the overall market bid predictability, for time-translation invariant states, as a function 
of the control parameter a = p/N for S G {3, 4, 5} (from top to bottom). Dashed: the 
corresponding phase transition points ac{S). 
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Figure 17. The two approximate volatility formulas <ta and as (lower and upper solid 
curves, respectively) as functions of q = p/N, for S G {3, 4, 5}, calculated in the ergodic 
regime a > ac- They are shown together with volatility data measured in numerical 
simulations of MGs without decision noise, for both biased initial conditions (random 
initial strategy valuations drawn from [—10, 10], •) and for unbiased initial conditions 
(random initial strategy valuations drawn from [—10"'', 10"^], o). Simulation system 
size: N = 4097. Dashed: the corresponding phase transition points ac[S). 



t>t' >0: 



t-t' 



(76) 



n=0 
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% E E E E i-vr^'''iGnUG-'),,Cs, (77) 

V s=0s'=0n=0n'=0 



For short times all these expressions reduce to simple finite sums; for instance: 



-Roo = -Rii = ^00 = v'^ 

Rio = —fj^Gio, Sio = fj'^iCio — Gio) 

R21 = ~v'G2i, Sn = 77' - 2fG,oC,o + v\G,of 

20 



R2o = fG2iGio-fG: 



One can now in the familiar manner solve the effective single agent agent process for the 
first few time steps. The resulting expressions would be fully exact (for any a, including 
those in the nonergodic regime a < adS)), but increasingly involved. 

For very large and very small a matters become relatively simple, but the result is 
still quite informative. We will discuss only the deterministic case; adding decision noise 
will only introduce transparent stochastic variations of the behaviour described below. 
For large a and finite times the single agent process tells us that G = 0{l/fiy/a), that 
the effective Gaussian noise scales as rjait) = 0{1), and that 

t = : Va{l) = -a^a.argmax^KCO)] + 0{^/a) (78) 
t>0: Va{t + 1) = Va{t) - a5a,argmaXi,K(t)] + Cl{^/a) (79) 

Let us assume that Va{0) = 0{a^) for all a, and exclude the pathological case where 
two or more initial valuations are identical. According to (fTSj) . the strategy with the 
largest initial valuation is selected at time t = 1, but will then move to the back of the 
list of ordered valuations. At the next step the second largest is selected, and then, 
following (f?^ . also moved to the back of the ordered list, etc. The net result is that 
all valuations Va{t) will become increasingly negative (proportional to a), growing on 
average linearly with time, and that the effective agent will continually alternate his S 
strategies in a fixed order, being the order in which the valuations are ranked initially. 
If, for example, Wi(0) > ^2(0) > . . . > vs{0), then argmax^[t>6(t)] =t+l mod S, and for 
t > the effective agent equation gives 

t~i 

Va{t) = - « E "^a.t'+l mod S + 0{t^/a) 
t'=0 

= - a int[ ^ ] + 0{t^) (80) 

where int[2;] denotes the largest integer m such that z > m. This is the generalization 
to 5* > 2 of the period-2 oscillations known to occur in MGs with 5 = 2. For arbitrary 
valuation initializations v(0) = vq the above solution generalizes to 

Va{t) = - a int [ S + t + l -n,,{a) l ^ ^^^^^ (g^^ 

o 

with TTv denoting that permutation of {1, ... , S} for which t>^(i) > t>^(2) > . . . > Vt,(^s)- If 
Vq is drawn randomly from some finite- width distribution, corresponding to the situation 
where the agents in the original A^-agent system are initialized non-identically, all agents 
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would still continually alternate their strategies in a fixed order, but the orders would 
now generally be different for different agents. The dynamic order parameters would in 
either case be Cu' = 5t,t'mod s + 0[pr^l'^) and Gtv = 0[pr^l'^). 

Let us finally turn to small a. Here the effective process (II 9^ describes only small 
valuation changes at each time step, and one consequently finds that Ctv = 1 + 
and Git' = 0{y/a). The effective Gaussian noise in (fT^ is static in leading order in a, 
{Va(t)Vb(t')) = v'^^ab + 0{y/a), and in the absence of perturbation fields equation (fT^ 
gives simply 



where the Za are independent frozen random Gaussian variables, with zero average and 
unit variance. The systems remains static for a period of order t ~ A/fjy/a, where 
A indicates the magnitude of the initial valuation differences. A full analysis of the 
solution of our effective agent equations following this transient stage is in the small a 
regime a highly nontrivial exercise, which (to our knowledge) even for S = 2 has not 
yet been carried out, and would merit a full and extensive study in itself. 

12. Discussion 

In this paper we have shown how the generating functional analysis theory of minority 
games, developed in full initially only for S = 2, can be generalized to MGs with 
arbitrary values of 5*. The key obstacle in this generalization turned out not to be 
the derivation of closed equations for dynamic order parameters (via a generalized 
effective single agent process) but rather the solution of these equations in time- 
translation invariant stationary states. In previous studies closure of persistent order 
parameter equations could not yet be achieved analytically, and equations had to be 
closed artificially with the help of simulation data [El IE] • At a technical level the basic 
problem was the calculation of the strategy selection frequencies of the effective agent. 
This problem has now been solved, resulting in exact and explicit closed equations for 
persistent order parameters and for phase transition points, for any value of S. 

In our applications of the resulting theory we have mainly concentrated on the 
simplest nontrivial case 5 = 3, complemented by further applications to 5 = 4 
and 5 = 5. In all cases, the predictions of our theory in time translation invariant 
stationary states without anomalous response were shown to agree perfectly with 
numerical simulation data, including sensitive measures such as the strategy frequency 
distribution. We have not been able to solve our order parameter equations in all 
possible situations, however, those regimes where we could not proceed to full solution 
(e.g. calculating stationary states in the regime a < adS), and non-persistent order 
parameters at arbitrary times) are the same as those which also for the simpler case 
S = 2 have so far resisted the efforts of statistical mechanicists. Put differently, our 
objective and contribution here has been to raise the solvability of MGs with arbitrary 
values of 5* to the same level as that of MGs with S = 2. 
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It will clear that several further applications, developments and generalizations 
of the theory could now be taken up. One could for instance explore in more detail 
the effects of decision noise on MGs with S > 2, for which we have generated the 
required mathematical tools but for which we have not worked out the full consequences 
(such as the often counter-intuitive impact on the volatility, or the phase diagrams for 
multiplicative noise in the (a,T) plane). Alternatively, one could develop an S* > 2 
generating functional analysis for the so-called fake history on-line MGs jSj, where 
valuation updates are made after each randomly drawn sample of the global information. 
Probably the most interesting and nontrivial next step, however, would be to investigate 
the structure and the stationary state solutions of an 5* > 2 theory for MGs where the 
global information is no longer drawn randomly but represents the actual global history 
of the market, by generalization of [I^ . 



Note 

While finishing this paper we were made aware of another study in progress, aiming 
also to solve the strategy frequency problem for minority games with more than two 
strategies per agent, but in the context of multi-asset MGs and using a somewhat 
different approach (which one must ultimately expect to be mathematically equivalent). 
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Extremization of the exponent \1' + $ + as defined by p(ipi7pi8j) . with respect to the 
dynamic order parameters {C, C, K, K, L, L} gives the following saddle-point equations: 



m{v{t),z(t))Sa,m{v(t'),z(t'))) -k 



Ktt' = E(^. 



a,m(v{i),z,{t)) 



Y.^Va{t)Ut'))^ 



Vait% 



Ctt' = i 



9$ 



Kw = i 



(9$ 



Ltt' = i 



(9$ 



dCtt' dKtt' dLtt' 

with the abbreviation (/(v,v,z))^ = limAr_^oo Z]i(/(v, v, z)).j, where 



(/(v,v,z))i 



^i(v,v,z) 



/ 


•p-r dVa(t)dVa(t) 

I lat 2n 


(/(v, v,z)Fi(v, V,Z))2; 


/ 


T-r dVa (t)dVa (t) 

1 lat 2n 


(F,(v, v,z))2 



(A.l) 
(A.2) 
(A.3) 

(A.4) 

(A.5) 
(A.6) 



X e 



'^'^J2aU' ['^tt'<5a,m(v{t),z(t))'5a,m{v(t'),z(t'))+-^tt'^'ti(t)«a{t') + -'^tt"5a,m{v(t),z(t))^'»(*')] 



Via and the identity Z[0] = 1 one confirms as usual, for the physical saddle-point, 

that the Ctt' are indeed the correlations in Q, that Ltt' = 0, and that Ktt' = iCtt'- 
Putting ■j/' — >• (they are no longer needed) and choosing 6ia = 9a (site-independent 
perturbations) eliminates the dependence of ()A.6|) on i: Fj(v,v,z) = F(v,v,z). Next, 
to evaluate equations ()A.4jl we need to work out the function $ (fTTj) for small {Ltf}- 
Upon eliminating K via K = iG, and with the short-hands I for the identity matrix 
and {A^)tt' = ^t't, we find 



$ = a lo£ 



n 



dxtdxt 



271 



e 2 



I J2tt' '^^^w'^t'+'^'nY.tt' ^''^tt'^t' 



l-T^fY.^tLtt'Xt' + 0{L^ 



^alog det[(I + r/Gt)(I + 77G') 



2 

- laf E Ltt' [(I + fiG)-^C{\ + 77Gt)-i] + 0{L^ 
tt' 



For L = the three saddle-point equations ()A.4j) now become 
Ctt' = Ktt' = -av{l+f]G^)u} 
Ltt'= -harj^lil + fiGr'Cil + fiG^)-' 



(A.7) 

(A.8) 
(A.9) 



Upon inserting these expressions into ()A.6j) . and using causality (viz. Gtf = for 
t < t') one can now prove that the denominator of ()A.5|) equals one. This, in turn, 
implies that {g{-v,z)va{t))^, = id{g{'v,z))^,/d9a. We are then in a position to integrate 
out all occurrences of the conjugate integration variables {va}, and end up with the 
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(A.IO) 
(A.ll) 

(A.12) 



X n K(^) ~ ~1= {va{t + ^)-Va{t)-6a{t)+aflY^{l + flG)j5a,ra{w{t')At')) 
at v'^ t' 



X e 



WO)) /[n 

at 



'271 



Ett' r)a{tm+riG'<)C-\l+fiG)],,,na{t') 



<let—^[{l+fjG^)C-\l+fjG)] 



X n [^^a(t+l)-^^a(t)-^aW+«^5](I + ^G')-M (A.13) 

We recognize the above measure to represent the statistics of an effective single agent 
process, with dynamics defined as 

Va{t + 1) = Va{t) + 9a{t) - Otfj^ Ru>5a^rn{w{t'),x{t')) + V" Va{t) (A. 14) 

t' 

Here Ru' = r/(I + fiG)^J, and r7a(^) is a Gaussian noise characterized by the moments 
iVait)) = and iVaitHit')) = 6abiRGR%,. 



Appendix B. The volatility matrix 

Here we outline briefly how the generating functional (j^^ for overall bid fluctuations 



can be calculated via simple modifications of the generating functional Z[il)] defined in 
(jSI). Comparison with (jH)) shows that in the latter we should replace 



a,m{vj{t),Zj{t)) 



e 



(t),^i{t)) 



This is found to imply making the replacement a;f — + $^(t) in the disorder average, 
and ultimately has an effect only on the exponent $ of the saddle-point problem (leaving 
and Q unaffected). The latter will now become 



N 



n 



271 



tt' 



X e 
1 



-a log det[{l + f]G^){l + fiG) 



I -It. {Lw + ^RCR%, + 0{L^ 0^) 



7]^ 



(B.l) 
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We can now calculate from Z[(j)\ the quantities of interest. Upon emphasizing the 
dependence of flB.l|) on the fields </>, and using the normalization Z[0] = 1, we obtain: 



hm {A^{t)) 



hm {A,it)A,{t')) 



i lim lim — 



— i lim lim „ , , , e 



^ ^iv[*(0)-*(O) 



saddle 



— i lim 




d 



1^ (RCR^), 



rf 



(B.2) 



d^z[ct)] 



— lim lim „ , , ^ 

d(t)^{t)d(t),{t') 



lim lim , , " , , gJV[*(0)-*{O)] 
92 



saddle 



— lim - , , , , , 
^^0 d(j)^{t)(j)^{t j ^ 



r'6^,{RCR^) 



tt' 



(B.3) 



Appendix C. Integration identities 

Here we simply list (without proof) some of the basic identities that one uses in doing 
the various integrals in the 5 = 3 theory analytically, for the benefit of the reader: 

= lerf(^) - e-i"^ (C.l) 

jDxevi{A + Bx) = erf( ^^_A_ ) (C.2) 

(dxx erf(A + Bx) = , 6-^2/(1+25^) (c.3) 

^ v/7r(l + 252) 



y erf2(Ex) = - arctan (Vl + 452) - 1 (C.4) 
f 4 

jDxeTf{A + Bx) = - arctan (Vl + 4^2) - 1 



+ ^ / rfx e-^ erf( ^ J (C.5) 



Appendix D. The strategy frequency problem for multiplicative noise 

Here we discuss briefly the solution of the strategy frequency problem for multiplicative 
decision noise: m(v,z) = r argmax^[va] + (1 — r)b, with random r G {0,1} and 
b G {1, ...,5"}. The deterministic case corresponds to P{r) = 5^,1. Again we define 
T7*(x) = maxfjVbi'x) and the set A(x) via (jHH). Upon abbreviating A = J2rP{^)''^ ^ [O5 1]; 
the solution now proceeds as follows 
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• Since x) = s[tJa(x) + e^ls, x)] with linis^oo ^^al*, x) = we may write the 
second equation in as 

/a(x) = hm |A 5a,argmax,[U6(x)+e6(s,x)] + (1 " 

This second term can be nonzero only for a G A(x). Hence, once we know A(x) 
and v*{x.) the problem is solved: 

a^A(x): /,(x) = (l-A)/5, y^^^) = ax,^^a\[^-^) (D.l) 

V a D 

aGA(x): /,(x)=x,./^-^, U„(x)=r(x) (D.2) 

V a 

• The value T7*(x) is again calculated from J2a fa{^) = 1, but now also strategies 
a ^ A(x) are involved. We abbreviate |A(x)|~^ J2b€A{x) Ub = {U)a{x.), and sum over 
the indices in both ()D.1|) and ()D.2|) to get 

equations ()D.1ID.2|) thus become 

a^A(x): /,(x) = (l-A)/5, Va{x) = axA^a^f^-^) (D.4) 

V a D 

aG A(x) : /,(x) = + + ^ (x)a(x)) , M^) = 

• What remains is to determine the set A(x). The definition (jHTj) of A(x) demands 
that Va{x) < v*{x) for all a ^ A(x), i.e. 

a ^ A(x) : Xa < (x)a(x) - ^/f ^^'^"^ 
Demanding that expression ()D.5j) obeys /a(x) G [0, 1] gives, similarly 

a G A(x) : X, > (x)a(.) - /f + ^) (D-7) 

fa / X 1 — A\ fa ,^ ^, 

..<{.W,-^-(^ + — ) + ^- (D^8) 

So A(x) again contains the indices of the i largest components of the x, where 
i = |A(x)|. For each x we define the permutation tTx : {1, . . . , S'} — > {1, . . . , 5} for 
which these components will be ordered according to ^^^(i) > Xt,(^2) > ■ ■ > Xt,(^s), so 
that our three inequalities can be written as 

e-i 



^E^^M-^/f(A + (l-A)|) if^>l (D.9) 

1 ^ 1 fOi 

<jYl ^^{ra) - A if £ < S" (D.IO) 



m=l 

^-(1) <^E^-M-^/f(A + (l-A)|) + y| (D.ll) 
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Similar to the case of additive noise we conclude that i is the smallest number in 
{1, . . . , 5} for which (lDlO|) holds (if any), whereas if (lDlO|l never holds then i = S. 
We thereby satisfy also ()D.9|1 (compared to additive noise we here even satisfy a 
stronger condition). The proof that also the third condition ()D.11|) will be satisfied 
is virtually identical to that given for additive noise, so need not be repeated here. 
Since ()D.9|) and ()D.10|) are no longer mutually exclusive for A < 1, it is no longer 
immediately obvious that the solution is unique, but this can probably be proven. 

From this point onwards one can translate the solution found into expressions for 
averages of observables. This will once more involve a sum over all possible permutations 
(from which the component ordering permutation tTx is selected via a product of step 
functions) and further step functions to implement the inequalities on the components 
of X relative to the average over the strategies in the set A(x) that define this active set. 



